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Abstract 

Multivariate stochastic volatility models with skew distributions are proposed. Exploit- 
ing Cholesky stochastic volatility modeling, univariate stochastic volatility processes with 
leverage effect and generalized hyperbolic skew t-distributions are embedded to multivari- 
ate analysis with time-varying correlations. Bayesian prior works allow this approach to 
provide parsimonious skew structure and to easily scale up for high-dimensional problem. 
Analyses of daily stock returns are illustrated. Empirical results show that the time-varying 
correlations and the sparse skew structure contribute to improved prediction performance 
and VaR forecasts. 
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1 Introduction 

Multivariate volatility models have attracted attention for their adaptability of variances and 
correlations to time series dynamics in financial econometrics in particular. A number of works 
discuss mul tivariate generalized autoregressive conditional heteroscedasticity (GARCH) mod- 
els (see e.g 



Chib et al. 



Bauwens et al 
2005 



Asai et al. 



20060 and multivariate stochas tic volatility (MSV) models (see e.g.. 



20061: 



Gourieroux et al. 



20091) . Meanwhile, apart from symmet- 



ric distribution, several studies have addressed skew and heavy-tail properties in multivariate 
financ ial time series; their modeling strategies for return dist r ibutions use skew norma l distribu- 



tions (lAzzalini and Vallelll996l: lAzzalini and Capitanio 



1999 . 



20031: iGupta et al. 



2004), a skew- 
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Cauchy distribution (lArnold and Beaveii I2OOOI) . skew-elliptical distributio ns (IBranco and Dev 



2001 



(See 



Sahu et 



Azzalini 



al 



20031), and a finite mixture of skew-normal distributions (ICabral et al 



2005 



20121) . 



for a survey and discussion of skew distributions for both univariate and 



multivariate case s). In this context, mu l tivaria te G ARCH models wi th skew distributions have 



been proposed by 



Bauwens and Laurend (12005b and 



AasetaL 



(120061) . 



In the literature, little has been discussed about MSV models with skew error distributions. 



Zhang et al.l (1201 ih develop a multivariate analysis of the generalized hyperbolic (GH) distri- 



bution with time-varying parameters driven by the score of the observation d e nsity, based on 
the ge neral ized autoregressive score (G AS) model (see also. 



Creal et al 



201lh . 



Ishihara et al 



2011) and 



Ishihara and Omoril (120120 provide MSV models with a leverage effect, a stylized 



fact of financial returns, which induces skew conditional return distribution. In contrast, the 
current paper proposes MSV models with leverage effect, where structural errors follow the 
GH skew t-distribution. This is a natur al extension of standard univaria t e stochastic volatility 
proce sses with skew distributions (e.g.. 



Durham 



20071: 



Silva et al. 



2006 : 



Nakaiima and Omori 



2012h to multivariate analysis; time-varying covariance components are incorporated based on 



Smith and Kohn 



the Choles ky decomposition of volatility matrices, which is increasingly used in time series anal 
ysis (e.g. Jpinheiro and Bates 1996; 



2OO2I: iLopes et al. 



2012). A salient feature 



is that prior works on a developed Bayesian approach allow for parallel computation of condi- 
tional posteriors, which enables the new model to easily scale up to higher dimensions. 

Further, the new model includes a structure of skew selection for the multivariate series. 
Bayesian sparsity modeling has become a p opular meth od to explore parsimonious models in 



a wide range of statistical ana lysis (see e.g.,IWestll2003h. Standard sparsity priors for vari able 



selection in regression models dGeorge and McCullochlll993 , 1997 : Clyde and Georgell2004 ) are 



employed for selecting zero or non-zero skewness parameter in the GH skew t-distribution for 



each series. As a related work. 



Panagiotelis and SmithI (120101) consider the sparsity prior on a 



coefficient of skew in a multivariate skew ^-distribution. In the current paper, the sparsity prior is 
assumed for the skewness parameter in the GH skew t-distribution. Empirical studies using time 
series of stock returns show that the skewness selection, in addition to the dynamic correlated 
structure, reduces uncertainty of parameters and improves forecasting ability. 

Section 2 defines the new MSV models with the GH skew t-distribution. Section 3 discusses 
Bayesian analysis and computation for model fitting. An illustrative example in Section 4 uses 
a time series of S&P500 Sector Indices to provide detailed evaluation of the proposed models 
with comparisons to standard MSV models. Section 5 presents a higher dimensional study of 
world-wide stock price indices to demonstrate the practical utility of the approach. Section 6 
provides some summary comments. 
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2 Multivariate stochastic volatility and skew distribution 

This section first introduces the GH skew t-distribution in a univariate case in Section I2.1[ 
and then defines the new class of MSV models with the skew distribution in Section [Z2l 

2.1 GH skew t-distribution 

Suppose a univariate time series, {wit,t = 1,2,...}, follows the GH skew t-distribution that 
can be written in the form of normal variance-mean mixture as 



Wit 



m + /3iZit + ^/z^t£it, 



(1) 



with £it ~ A^(0, 1), and za ~ IG{ui/2, Ui/2), where IG denotes the inverse gamma distribution. 
The previous studies often assume that = —fiiCi, where q = ^{za) = Vijiyi — 2), for ^{wit) = 
0, and z/j > 4 for the finite variance of wu, which is taken here. This is a special case of a more 
gen eral class of th e GH di stribution (see e.g., Uas and Haflbood) . As discussed by lPrausd (119990 
and lAas and Hafn (120061) . the parameters of the general GH distribution are typically difficult 



to jointly estimate. Therefore, the current paper uses the GH skew i-distribution in the form of 
eqn. ([!]), which includes necessar y parameters enough to de scribe the skew and heavy- tails of 



the financial return distributions (iNakaiima and Omorill2012 l) . 

A key structure of the class of GH distributions is that the random variable is represented by 
the normal variance-mean mixture: a linear combination of two random variables that follow 
standard normal distribution, and the generalized inverse Gaussian (GIG) distribution, a more 
general class of the inverse gamma distribution taken for the GH skew t-distribution. The com- 
bination of a mixing weight /3i in eqn. ([1]), called an asymmetric parameter, and the scale param- 
et er Uj determines the s kew ness and heavy-tailedn e ss of t he resulting distribution. As illustrated 
by lAas and Hafn (l2006b and iNakaiima and Omoril (l2012h . the /3i represents the degree of skew 
with Ui fixed, and the Ui represents the degree of hea yy-tails with fixed. There are other def- 



initions for the skew t-distributions in the literature (IHansen 



Prause 



199 



i 



Jones and Faddy 



2003 



Azzalini and Capitanio 



1994; 



2OO3I) . However, the GH skew t- 



Fernandez and Steel 



1998 



distribution defined above has a great advantage in consonance with Bayesian modeling of 
latent variables. The representation of the normal variance-mean mixture leads to an efficient 
computation with conditional samplers for the latent variables in model fitting using Markov 
chain Monte Carlo (MCMC) methods, described in Section [3l 

2.2 Cholesky multivariate stochastic volatility 

Define a k x 1 vector response time series = {yu, . . . -.ykt)' , {t = 1,2, . . .). A standard 
Cholesky MSV model defines t/j ~ N{0, St) with the triangular reduction AtTitA'^ = A^, where 
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At is the lower triangular matrix of covariance components with unit diagonal elements and At 
is diagonal with positive structural variance elements: viz. 



/ 1 

-021,t '■• 
\ — flfcl,* • • • 



\ 







-ak,k-i,t 1 / 



/ Alt 


V 



••• \ 



■•■ 

Xkt J 



This implies Sl^ = A^ ^A^(Aj) ^, and = A^ ^AtCt, where et ~ N{0,I). The construction 
of this Cholesky decomposition has appeared in previous works for constant covariance ma 



trices (IPinheiro and Bated 1 19961: 



Pourahmadi 



19991: 



Smith and Kohn 



and dynamic covariance modeling with stochastic volatility models (ICogley and Sargent 



2002 



George et al 



Primiceri 



2005 



2008) 



2005 



Lopes et al.ll2012h . A salient feature in Bayesian modeling of Cholesky MSV 



models for time-varying parameters of the covariance/variance elements is that the approach 
reduces the multivariate dy namics to un i variate volatility processes that form a state space rep- 
resentation, as discussed by lLopes et alj (120 12h . The new idea exploits the Cholesky structure 
for modeling MSV and embeds the GH skew i-distribution as follows. 
The new class of models is defined by 

Vt = A^^KtWt, 

where wt = {wu, ■ ■ ■ ,Wkt)' is the k x 1 vector whose element wu independently follows the 
GH skew i-distribution defined by eqn. ([1]). Define ht = {hu, . . . , h^t)' as the A; x 1 vector of 
stochastic volatility in Aj with hu = log(A?^), for i = 1, . . . , /c, and at = (ait, • • • , ^pt)' as the p x 1 
{p = k{k — l)/2) vector of the strictly lower-triangular elements of At (stacked by rows). The 
time-varying processes for these Cholesky parameters are specified as 



and 



ht+i 
at+i 

I st\ 
Vt 



fi + ^{ht - fj,) + Tjt, 

Ma + *a(at - Ma) +^t, 



(2) 



N 



I Q 
Q S 

0V 
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where et = {en, . . . jSkt)' , and each of {^,^a, S,Q,V) is assumed diagonah $ = diag( 



< 1, 



*a = diag((/)ai), S = diag(crf), Q = diag(pio-i), and V = diag(t;^j), with \<pi\ < 1 and \(t>aj 
for each i = 1,. . . ,k, and j = 1, . . . ,p. Thus all univariate time-varying parameters follow 
stationary AR(1) processes. The identity = Aty^ = AfWt lead s to a set of univariate stochastic 



volatilities with the GH skew i-distribution (INakajima and Qmori 



2012 ) 




{/3i{zit - Ci) + ^/z^teu}exp{hit/2), 
Hi + ct>i{hit - Hi) + r]it, 



N{0,fti), and fti 



i PiCTi 

PiCTi erf 



(3) 
(4) 

(5) 



where {ya, pi,<pi,r]it) are the i-th (diagonal) elements of {y^, fi,^,r]f), respectively, for i 
1,. . . ,k. The Pi measures the correlation betw een e, t ^ and mt, which is typically negative for 
stock returns as the so-called leverage effect (IYuI|2005 



Qmori et al. 



20070. The class of univari- 



ate stochastic vo 



Kim et al. 



19981: 



atility models has been we 



Ghvsels et al. 



20021: 



Eraker 



1 stud i ed in the literature fe.g.Jjacquier et a. 



2004 



In the context of MSV modeling (IChib et allbood: ksai et al.ll200Q: IGourieroux et al 



Shephardll200 5: Nak aiima and Qmori 



2004; 



2009). 



2009h . 



the proposed model here is a natural extension of the univariate stochastic volatility model with 
the GH skew t-distributions embedded in the Cholesky-type multivariate structure. Most of the 
multivariate skew distributions and their extension to volatility models in the previous literature 
often have the difficulty of scaling up in dimension of responses in terms of computation of 
model likelihoods and parameter estimates. In contrast, an inference of the new model reduces 
to that of simply k univariate stochastic volatility models; this leads an efficient and fast parallel 
computation under conditionally independent priors as specified below. 

2.3 Skew selection 



As mentioned by IPrimiceril (120051) . it is not straightforward to theoretically explore com- 
pounded processes of covariance/v ariance elements in the Cholesky-type covariance matrix. 
(See Appendix B of INakajima (120121) for characteristics of the resulting covariance matrix pro- 
cess.) To understand the skew in the Cholesky MSV models, a simulation study follows. 

A sample of size T = 1 ,000 and A: = 5 is simulated according to the proposed MSV model with 
fixed parameter values: (pi = 0.995, ai = 0.05, pi = —0.5, pi = —9, Ui = 20, and aj = 0.5, for all 
i, j. These values are selected following empirical studies in previous works. The value of a/s set 
here implies correlations between responses around 0.3-0.8. For the skewness parameter, four 
sets of values are considered: /3 = . . . , = (i) Oixs, (ii) — lix5> (iii) (— — — 0, 0), 



5 



(i) P = (0, 0, 0, 0, 0) (ii) p = (-1 , -1 , -1 , -1 , -1 ) 




12345 12345 



(iii) P = (-1 , -1 , -1 , 0, 0) (iv) p = (0, 0, -1,-1,-1) 




12345 12345 

Figure 1: Skewness of simulated data: means (solid line), 50% (filled area, dark) and 80% 
(light) intervals for 1,000 sets of simulated series. The horizontal axis refers to the series index 
i. 



and (iv) (0, 0, —1, —1, —1). Figure [T] shows summaries of skewness of simulated data from 1,000 
sets of simulation. The cases (i) and (ii) clearly exhibit no skewness and significant skewness, 
respectively. Interestingly, the case (iii) still yields skew observations including in the last two 
series (i = 4, 5) despite the zero skewness parameters. This is because the latter series inherit 
the former structur al processe s due t o the lower triangular structure of Cholesky components 
(see Appendix B of iNakaiimal (l2012h ). The case (iv) confirms this mechanism; the first two 
series do not exhibit skewness because the corresponding skewness parameters are zero, and no 
inherited structure arises. 

From these findings, the skewness parameter /3j's can be redundant for the latter series in 
the response vector y^. Shrinkage to zero of subsets of the skewness parameters addresses 
skew selection in the Cholesky MSV model, exploring more parsimonious structure to reduce 
estimation uncertaint y and improve predictions. A traditional sparsity prior for varia ble selection 



in regression models (IGeorge and McCuUoch 



1993 . 



19971: 



Clyde and Georgell2004) is employed 



for the skew selection. Specifically, the sparsity prior for has the form 



A ~ AciV(A|0,r2) + (l-K)5o(ft), 



(6) 
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for i = 1, . . . ,k, where 5o denotes the Dirac delta function at zero. This prior assigns the proba- 
bihty K of taking a non-zero value and the shrinkage probability 1 — k with a point mass at zero. 
Due to the structure of the normal-mean variance mixture and the conditional independence of 
univariate stochastic volatility processes, a conditional sampler for /3j under the sparsity prior is 
quite easy and simple as described in the next section. 

3 Bayesian analysis and computation 

Model fitting using the MCMC methods includes con d itional s amplers for univariate s tochas - 
tic volatility models with le verage effect (lOmori et al.l (120070 : lOmori and Watanabd (l2008b : 



Prado and West 



Nakaiima and Omoril (120 12f) ') and for the state space dynamic models (e.g., 
20101) . Based on observations y^^.j. = {yi, . . . , y^} over a given time period of T intervals, the 
full set of latent process state parameters and model parameters in the posterior analysis are 
listed as follows: 

• The stochastic volatility processes /ij,i:T and mixing latent processes zt^i-x, {i = 1, ■ ■ ■ ,k); 

• The covariance component process states ai-^T', 

• The skewness parameters f3 and the sparsity hyper-parameter k; 

• H5^er-parameters defining each of the univariate stochastic volatility processes, 0, = 

{fii, (f)i,ai,pi, Ui}, {i = 1, . . . ,k); 



• Hyper-parameters defining each of the covariance component processes, 6aj = {fJ.. 

Vaj}, (i = 1, . . . ,p). 

Components of the MCMC computations are outlined as follows. 

Stochastic volatility processes and mixing latent processes: The conditional posteriors for 
each of the latent volatility processes /ii,i:T, = 1, • • • , k) are sanipled using the MCMC tech 



nique f or the stochastic volati l ity mo dels with leverage (lOmori et al.ll2007 ; 



2008). 



Omori and Watanabe 



Nakaiima and Omoril (120121) implement the algorithm for the stochastic volatility with 
the GH skew i-distribution. Including the mixing latent process, Zi^i-x, (i = I, . . . ,k), these state 
processes are conditionally independent across i in the posteriors given all other latent vari- 
ables and hyper-parameters, which allows parallel generation of the volatility processes based 
on eqns. (©-([S]). 

Covariance component process states: Conditional on other latent process states and hyper- 
parameters, the MSV model reduces to a conditionally linear, Gaussian dynamic model for the 
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states ai-T- Specifically, 



yu = af^Xit + Sit, iit^ N {Q,Zit{l- pi)) , 

where yn = yite~^'''^-pi{zit-Ci)-^/zrtPim, fin = {hi^t+i-Pi)-(t>i{hit-Pi), xu = (yite"''"/^, . . . , 
te~^'"i */^)', and a|*^ denotes the 1 x (i — 1) vector of the free parameters in the i-th row of 
At, for i = 2, . . . ,k. This observation equation is coupled with the state evolution of eqn. ([2]); 
sampling full sets of the states is implemented using the stan dard forward filtering, backward 



sampling (FFBS) algorithm fe-g- Jde Jong and Shephard 



19951) 



Skewness parameters: Conditional on all the latent states and hyper-parameters, under the 
prior defined by eqn. ([S]), the posterior for the skewness parameter /3j is given by 

A I • ~ AtiiV(A|A,f2) + (1 - k,)So{Pi), 

where /3j and f?' are the posterior mean and variance of the posterior distribution for /3j under 
the normal prior A^(0, Tq); and ki = Kbi/{Kbi + 1 — k) with bi = exp{(3f /2fl)fi/TQ. For the 
parameter k, a beta prior is assumed; then we directly sample the conditional posterior given 
the number of /3j's such that /3j / 0. 

Stochastic volatility hyper-parameters: For each i = I, . . . ,k, traditional forms of priors for 
AR model parameters are assumed: normal priors for //j, shifted beta priors for each of (<pi, pi), 
inverse gamma priors for af, and truncated gamma priors for i/j {ui > 4), with prior indepen- 
dence across i. Conditional posteriors, given the other state varia bles and hyper-pararneters 



2012 . 



can be sampled directly or via Metropolis-Hastings algorithms. fSee lNakaiima and Omori 
Section 2.2). 

AR hyper-parameters Oaj- For each j = 1, . . . ,p, the same forms of priors are assumed for 
{fJ'aj,<Paj,Vaj) as {pi, (f)i,af). Conditional posteriors given the states ai-x can be sampled directly 
or via Metropolis-Hastings algorithms. 

Note that sampling each of (/ij,i:T, -Zj.iiT, cLi\, A, Oi) can be parallelized across i. In prelimi- 
nary simulation studies and the following empirical examples, MCMC streams were fairly clean 
and stable with quickly decaying sample autocorrelations in the same manner as the univariate 
stochastic volatility models. 

4 A study of stock price index 

The first study applies the proposed model to a series oik = b daily stock returns. An analysis 
particularly focuses on how the multivariate correlation mechanism and skew components re- 
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1 INDU Industrials 

2 CONS Consumer Staples 

3 FINL Financials 

4 ENRS Energy 

5 INFT Information Technology 

Table 1: S&P500 Sector Index. Sectors are ordered by smaller posterior means of the skewness 
parameter obtained from univariate stochastic volatility models with the skew t-distribution. 



veal dynamic relationships underlying the stock return volatilities and improve forecasting abil- 
ity. Note some connections w ith previous work on multivariate stock return time se r ies using dy- 
namic volatility models (e 



2011 



Ishihara and Omori 



Aas et al 



20061: 



Chib et al 



20061: 



Conrad et al 



2011 



Zhang et al 



20121) . 

4. 1 Data and model setup 

The data are S&P500 Sector Indices over a time period of 1,510 business days beginning in 
January 2006 and ending in December 2011. The returns are computed as the log difference of 
the daily closing price. The series are listed in Table [1] The ordering of the series in the vector of 
response matters due to the structure based on the Cholesky decomposition. From simulation 
results in Section [Z3l the series are ordered by smaller posterior means of the skewness parame- 
ter Pi obtained from the univariate stochastic volatility models with the skew t-distribution. This 
strategy induces more parsimonious skew structure, which improves forecasting performance as 
discussed below. 

The following priors are used: (0=,, + l)/2 ~ B{20, 1.5) for each * G {i,ai}, fXi ~ A^(— 10, 1), 
^^ai ~ N{0,1), ^ G(20,0.01), v'^^ ~ G(20,0.01), {pi + l)/2 ~ B{1,1), Vi ~ G(16, 0.8)/[i^i > 
4], /3j ~ KiV(/3j|0, 10) + (1 - K)5o(ft)j and k ~ 5(2,2), where B and G denotes the beta and 
gamma distributions, respectively. The MCMC analysis was run for a burn-in period of 5,000 
samples prior to saving the following 50,000 samples for posterior inferences. 

The study provides forecasting performance in comparison among different specifications in 
the proposed class of models. The following five models are considered: 

• S: Skew i-distribution, no sparsity on /3j (k = 1), no correlation (A^ = I); 

• SS: Skew t-distribution with sparsity on no correlation; 

• C: Symmetric t-distribution = 0), with correlation; 

• CS: Skew t-distribution, no sparsity on /3j, with correlation; 

• CSS: Skew t-distribution with sparsity on and correlation. 
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The key focus here is on the skew in return distribution, sparsity structure on the skewness 
parameter, and the Cholesky-type correlation mechanism in the MSV. 

4.2 Forecasting performance and comparisons 

Out-of-sample forecast performance is examined to compare the competing models in pre- 
dicting 1 to 5 business days ahead. Forecasts are based on a posterior predictive density sampled 
every MCMC iteration. An experiment is implemented in a traditional recursive forecasting for- 
mat; the full MCMC analysis is fit to each model to obtain the 5 -horizon forecasts given data 
from the start of January 2006 up to business day Tj with Tj = Tj_i + 5. Specifically, each model 
is first estimated based on data Vi-t^ where Ti = 1,010. The resulting out-of-sample predictive 
distributions are simulated over the following 5 business days, t = Ti + 1, . . . , Ti + 5. Next, the 
analysis moves ahead 5 business days to observe the next 5 observations Z/ri+i:Ti+5 ^^'^ reruns 
the MCMC based on the updated data yi-j-^' where T2 = Ti + 5, forecasting the following 5 
business days t = T2 + I, . . . ,T2 + 5. This is repeated with Tj = Ti^i + 5 for « = 2, . . . , 100, 
generating a series of out-of-sample forecasts over 500 business days. This experiment allows 
us to explore forecasting performance over nearly 2-year periods of different financial market 
circumstances and so examine robustness to time periods of the prediction ability. 

The first measure of formal model assessments is out-of-sample predictive densities. The 
log predictive density ratio (LPDR) for forecasting d business days ahead from the day t is 
LPDRt(d) = log{pM^{yt+d\yi:t)/PMo{yt+d\yi:t)}' where pA/(yt+dl?/i:i) is the predictive density 
under model M. This quantity represents relative forecasting accuracy in the prediction exer- 
cise. Table [2] reports the LPDRs of four competing models relative to Model S at each horizon. 
Improvements in out-of-sample predictions are practically evident for the proposed multivari- 
ate skew models. The LPDRs for Models C and CS show relevance of correlated structure, and 
differences in those for Models C and CS indicate dominance of the skew component in the 
multivariate stock returns. The LPDRs for Model SS and comparisons in those for Models CS 
and CSS show that the sparsity on the skew parameters contributes to improved predictions, 
robustly across horizons. The LPDRs for Models CS and CSS at the 2nd and 4th horizons are 







Horizon id days) 






Model 


1 


2 3 4 


5 


Total 


SS 


7.0 


13.2 11.2 16.7 


15.4 


63.5 


C 


34.0 


54.7 45.5 52.8 


50.9 


237.9 


CS 


88.5 


216.8 85.2 240.2 


144.5 


775.1 


CSS 


92.8 


230.6 91.5 259.7 


156.9 


831.4 



Table 2: Cumulative log predictive density ratios LPDRt(d) relative to Model S. 
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relatively inflated, which is due to two time points under market shocks related to the European 
sovereign-debt crisis. In turbulent situations, the skew and correlated structures yield substan- 
tially increased improvements. Even if these two times are removed from the full period of 
comparison, the models still show relevant dominance over the standard MSV models. 

The second measure of the formal model comparisons is based on Value-at-Risk (VaR) fore- 
casts of portfolio returns. Using samples from the posterior predictive distribution, optimal port- 
folios are implemented under several allocation rules, and the VaR foreca st of the re sulting port- 
folio is obtained at each time Tj + 1, . . . , Tj + 5, for i = 1 . . . , 100. Note that lsauwens and Laurent 



(l2005b illustrate a similar procedure in evaluation of the VaR forecasts. A main focus here is on 
an impact of the proposed multivariate skew model on forecasting accuracy, in particular for a 
tail risk of multivariate responses. 



The analysis uses standard Bayesian mean-variance optimization (iMarkowitzi 1 1 95 9f) . Based 
on the samples from the posterior predictive distribution, the forecast mean vector and vari- 
ance matrix of y^, denoted by and Dt respectively, are computed. Investments are allocated 
according to a vector of portfolio weights, denoted by ujt, optimized by the following alloca- 
tion rule. The realized portfolio return at time t is uj'tyt. Given a (scalar) return target m, we 
optimize the portfolio weights ujt by minimizing the forecast variance of the portfolio return 
among the restricted portfolios whose expectation is equal to m. Specifically, we minimize an 
ex-ante portfolio variance u'^DtuJt, subject to u:[g^ = m, and uj[l = 1, i.e., the total sum in- 
vested on each business day is fixed. The solution is cj^*"-* = Kt{l' KtqtQt — g't^tQt^), where 
Qt = (Im - gt)/dt, and dt = {V Ktl){g'tKtgt) - {I'Ktgtf, with Kt = D^K The study also 
considers the target-free minimum- variance portfolio given by = Ktl/{1' Ktl). The port- 
folio is reallocated on each business day based on 1- to 5- business day ahead forecasts. This 
experiment assumes a practical situation that investors allocate their resource every business 
day based on weekly-updated forecasts. Note that the resources are assumed freely reallocated 
to arbitrary long or short positions without any transaction cost. 

In summary of the VAR forecasts, the number of VaR violations, denoted by n, is counted 
over N = 500 experiment days. The expected number of violations for a quantile is aN; under 
the null hypothesis that the expected ratio of violations is equal to a, the likelihood ratio statistic, 



is asymptotically distributed as x^(l) (see lKupiedll995r) . Table [3] reports the number of VaR 



violations and results of the likelihood ratio test for a = 0.5%, 1%, and 5% levels, based on 
a range of daily target returns of m = 0.005%, 0.01%, and 0.02%, implying a yearly return of 
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(1) Violations 
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1 OA 


43 


32 


i 1 






5% 


69 


60 


41 


79 


C 


0.5% 


30 


22 


12 


45 




1% 


39 


29 


18 


52 




5% 


64 


48 


33 


82 


CS 


0.5% 


2 


2 


2 


5 




1% 


5 


7 


3 


8 




5% 


18 


19 


17 


25 


CSS 


0.5% 


3 


2 


1 


4 




1% 


6 


5 


2 


6 




5% 


24 


26 


22 


24 



(2) p-values 



Target return (m) Target 



Model 


a 


0.005% 


0.01% 


0.02% 


-free 


S 


0.5% 


0.00 


0.00 


0.00 


0.00 




1% 


0.00 


0.00 


0.01 


0.00 




5% 


0.00 


0.00 


0.32 


0.00 


SS 


0.5% 


0.00 


0.00 


0.02 


0.00 




1% 


0.00 


0.00 


0.02 


0.00 




5% 


0.00 


0.00 


0.00 


0.00 


C 


0.5% 


0.00 


0.00 


0.00 


0.00 




1% 


0.00 


0.00 


0.00 


0.00 




5% 


0.00 


0.00 


0.12 


0.00 


CS 


0.5% 


0.74 


0.74 


0.74 


0.16 




1% 


1.00 


0.40 


0.33 


0.22 




5% 


0.13 


0.20 


0.08 


1.00 


CSS 


0.5% 


0.76 


0.74 


0.28 


0.38 




1% 


0.66 


1.00 


0.13 


0.66 




5% 


0.84 


0.84 


0.53 


0.84 



Table 3: VaR results: the number of violations and p-values for the likelihood ratio test. The null 
h5^othesis is that the expected ratio of violations is equal to a. 
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approximately 1.25%, 2.5%, and 5.0% respectively, as well as the target-free portfolio. For a 5% 
significance level, the null hypothesis is rejected in almost all cases of VaR quantiles and portfolio 
rules for Models S, SS, and C. The large number of their VaR violations indicate that these 
models forecast smaller values of VaR (in their absolute value) than necessary. This optimistic 
risk forecast is due to lack of structure including both skewness and correlations. In contrast, 
the null hypothesis is not rejected in all cases for Models CS and CSS except in only one case, 
a = 5% with m = 0.02% for Model CS. 

The results from the out-of-sample forecasting experiments reveal that the skew and corre- 
lated multivariate structure contributes to the forecasting performance in terms of the predictive 
density and the VaR risk analysis. In particular, the sparse skew model with correlation, Model 
CSS, achieved the best posterior predictive densities and passed the VaR likelihood ratio test 
for all the realistic situations. These findings are similar to those obtained from different prior 
densities; a prior sensitivity analysis is provided in the Appendix. 

Regarding the ordering of the response in t/^, Bayesian prior works on the orderings can 



be considered (see Nakajima and Watanabd I2OIIL for reversible-jump MCMC methods in the 



Cholesky MSV models), although this is beyond the scope of this paper. Instead, the reverse 
ordering of the responses is examined here to compare with the current baseline ordering. Fore- 
casting performances are computed for Model CSS; results show weaker forecasting ability of 
the reverse ordering than the baseline ordering in terms of the predictive density and VaR fore- 
casts. This confirms that the baseline ordering based on the posterior means of /3j's has an 
advantage over the reverse ordering. 

4.3 Summaries of posterior inferences 

Posterior estimates are summarized for results of the MSV models fit to data y^.j. with 
T = Ti = 1,010. Figure [2] displays posteriors of model parameters for Model CSS. One re- 
markable finding is that the posterior for /3i is estimated negative considerably apart from zero, 
although the posteriors for other /3j's (i = 2, . . . , 5) exhibit shrinkage at zero; their posterior 
probabilities of shrinkage are about 91-94%. This parsimonious skew structure evidently im- 
proves forecasting ability compared to the non-sparsity model as reported in the previous sub- 
section. Figure [3] plots the posterior estimates of /3j for four competing models. For Models S the 
/3j's are estimated negative with reported credible intervals that are mostly apart from zero. For 
Model SS, moderate shrinkages are found for ifi^iPi), and considerable shrinkage is observed 
for In contrast, Model CS exhibits credible intervals including zero except for and the 
evident shrinkages in Model CSS yield the parsimonious skew structure. 

The posteriors for the other parameters in Figure [2] are consistent with previous studies. The 
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Figure 2: Posterior estimates for parameters from Model CSS: Posterior medians (solid line) 
and 50% (filled area, dark) and 80% (light) credible intervals. The horizontal axis refers to the 
series index i. 
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Figure 3: Posterior estimates for Pi'. Posterior medians (solid line) and 50% (filled area, dark) 
and 80% (light) credible intervals. The horizontal axis refers to the series index i. 
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Figure 4: Posterior means of the selected standard deviation (top, CONS) and correlation (bot- 
tom, INDU-CONS) in 

posterior medians of pj's are estimated negative, indicating the leverage effect for stock return 
dynamics. One possible extension from the current model is sparsity for p^'s, although a result 
from Model CSS with the same sparsity prior embedded to p^'s showed only little evidence of 
sparsity for all i. 

Figure [4] graphs trajectories of posterior means of a selected standard deviation and correla- 
tion, denoted by Vi and rjj respectively, in the resulting covariance matrix T^t = Aj^ K^{A[)^^ . 
Note that the figure shows only a part of sample periods for visual clarity. The top panel shows 
the standard deviation of CONS {i = 2) from three MSV models. Model C yields higher standard 
deviations than the other skewed models due to the symmetric t-distribution that estimates the 
left tail lighter than the skew models. Model CSS yields higher standard deviations than Model 
CS because of the shrinkage toward zero. These differences tend to be larger in high-volatility 
periods. The same feature is found in the correlations; the bottle panel shows the correlation 
between INDU {i = 1) and CONS. Model CS yields less correlated structure due to its skew 
error distribution. Meanwhile, across the series and sample periods, the correlation is evidently 
time-varying for the stock return data, which results in the contribution of the Cholesky-based 
time-varying correlation structure to the improved prediction ability. 

Further, Figure [5] shows approximated posterior joint predictive densities of (2/i,t+i, y2,T+i) 
and (y2,T+i, ?/3,T+i) in surface plots with tail behaviors displayed in scatter plots. Compared to 
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Figure 5: Surface plots for smoothed joint predictive density for (yi,T+i,y2,T+i) (top) and 
(y2,T+i, y3,T+i) (bottom) based on MCMC samples. Scatter plots are displayed for tail sam- 
ples, defined as samples in regions where the smoothed density is less than 1% of the maximum 
density. 



Model SS, the correlated MSV models (C and CSS) exhibit a clear image of correlated predictive 
densities. Model CSS yields more tail samples in the left tails due to the negative skewness. 
These differences result in the large improvement of VaR forecasts illustrated in the previous 
subsection. 

5 A higher-dimensional study: World-wide stock price indices 

This section provides a higher-dimensional example for the skew and correlated MSV models 
using k = 20 world-wide stock price indices (see the list of countries and regions in Table |4]) . 
These are selected as major indices traded in the global financial market; note that both the 
Euro and several European countries are included, although their time series do not exhibit 
severely high correlations. The time period is T = 1,258 business days beginning in January 
2006 and ending in December 2010. The returns are computed as the log difference of prices 
at the closing time of the US market. The variables in are ordered by posterior means of the 
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1 


Euro 


11 


Brazil 


2 


US 


12 


Spain 


3 


India 


13 


Russia 


4 


Taiwan 


14 


Swiss 


5 


Netherlands 


15 


Hong Kong 


6 


Japan 


16 


UK 


7 


Mexico 


17 


Australia 


8 


Sweden 


18 


Germany 


9 


France 


19 


Canada 


10 


Italy 


20 


Korea 



Table 4: World-wide stock price indices. Countries are ordered by smaller posterior means of 
the skewness parameter /3j obtained from univariate stochastic volatility models with the skew 
t-distribution. 

skewness parameter /3j obtained from the same pre-analysis, and the study uses the same prior 
specifications as in the previous section. 

Figure [6] shows posteriors of the model parameters for Model CSS. A remarkable evidence 
is considerable shrinkage of /3j's except for the first two series, suggesting much parsimonious 
structure induced by the skew selection. Figure [7] plots posteriors of /3j for Models S and CS. 
Note that the series are ordered by posterior means of /3j's based on Model S, although the pos- 
terior medians displayed here are not monotonically increasing. Model CS exhibits interesting 
estimates; a posterior distribution of /Js leans to positive, presumably for adjusting the skewness 
in connection with the former series on the Cholesky-type compound processes. This finding and 
the evidence of shrinkage in Model CSS suggest that the skewness parameter can be redundant 
in the correlated MSV models. 

Other parameters in Figure [6] show some differences in behaviors of stock price indices 
among the countries. The series of Spain (i = 12) and Russia {i = 13) exhibit smaller (/)j's 
and larger fj/s, implying less persistent volatility dynamics. The series of EUR (i = 1) and US 
(i = 2) show large leverage effects with posterior medians of pi below —0.5. An important ad- 
vantage here is that the proposed Cholesky MSV models easily scales up in its dimension with 
the reduction of posterior computation to univariate stochastic volatility analysis. 
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Figure 6: Posterior estimates for parameters from Model CSS: Posterior medians (solid line) 
and 50% (filled area, dark) and 80% (light) credible intervals. The horizontal axis refers to the 
series index i. 




Figure 7: Posterior estimates for /3j: Posterior medians (solid line) and 50% (filled area, dark) 
and 80% (light) credible intervals. The horizontal axis refers to the series index i. 
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6 Concluding remarks 

A new framework of building correlated multivariate stochastic volatility models with skew 
distributions is developed. The approach of Cholesky-type covariance structure effectively em- 
beds the univariate stochastic volatility with leverage effects and the GH skew ^-distributions 
to the multivariate analysis. The salient feature of the proposed model is the skew selection 
based on the sparsity prior on the skewness parameters. In stock return analyses, the empirical 
evidence shows the sparse skew and dynamic correlated structures contribute to improved pre- 
diction ability in terms of the predictive density and portfolio VaR forecasts, which is practically 
relevant to business and policy uses of such models in investment and risk management. 

There are a number of methodological and computational areas for further investigation. 
In terms of modeling strategy, the sparse skew structure can be a pplied to factor stochas 
tic volatility models, which have been widely studied in 



Pitt and Shephard 



199 



Aguilar and West 



2000 : 



Chibetal 



iterature (IGeweke and Zhou 



1996 : 



200q). Also, the t i me-vary ing spar 



sity technique using latent threshold models proposed by lNakaiima and WestI (l2012aUbf) can be 
employed to explore more parsimonious covariance structure for the skew MSV models. One 
important open question is a po tential computational strategy of sequential particle learning 
algorithms dCarvalho et al.ll2010l) for the proposed MSV models, which would be useful in real- 
time decision making context. 
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Appendix. Prior sensitivity analysis 

A forecasting study with different prior distributions is examined for the S&P500 Sector 
Indices data used in Section [4l Consider the following priors: (Prior-1) k ~ B{2,8), (Prior- 
2) /3i ~ KN{^i\ - 1,2) + (1 " K)(5o(/3i), and (Prior-3) Ui ~ G(24, 0.6)/[z^i > 4]. All the other 
parameters remain the same as the baseline priors specified in Section \47\] Compared to the 
baseline priors, the new priors imply more concentrated densities of k (Prior-1) with posterior 
mean 0.2, shifted from 0.5 in the baseline, /3j (Prior-2) with posterior mean —1 from 0, and Ui 
(Prior-3) with posterior mean 40 from 20. 

Table [5] reports the cumulative log predictive density ratios of Models CS and CSS relative to 
Model S, which shows little difference among the different priors. In addition, the VaR forecasts 
are also computed in the same way as in Section !?^ the likelihood ratio tests indicate the null 
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Horizon (d days) 



Model 


Prior 


1 


2 


3 


4 


5 


Total 


C 


(1) 


34.0 


54.7 


45.5 


52.8 


50.9 


237.9 




(2) 


34.6 


54.0 


44.1 


50.9 


50.2 


233.8 




(3) 


48.6 


63.1 


58.3 


63.6 


68.1 


301.7 


cs 


(1) 


88.5 


217.6 


84.7 


240.6 


143.9 


775.2 




(2) 


89.9 


223.5 


81.5 


241.5 


148.1 


784.6 




(3) 


101.6 


222.7 


97.0 


250.9 


162.6 


834.7 


CSS 


(1) 


96.8 


232.1 


94.3 


256.8 


158.6 


838.6 




(2) 


95.7 


227.1 


89.2 


246.5 


150.0 


808.4 




(3) 


117.2 


245.6 


116.3 


276.2 


181.6 


937.0 



Table 5: Prior sensitivity analysis: Cumulative log predictive density ratios LPDRj((i) relative to 
Model S. 

hypothesis that the expected ratio of violations is equal to a is not rejected for Models CS and 
CSS with the those priors in any case considered in Table [3] at the 5% significance level. These 
findings indicate that the results of forecasting performance improved by the skew and corre- 
lated structure in the MSV models are quite robust regardless of prior specifications for those 
key hyper-parameters. 

References 

Aas, K. and Haff, I. H. (2006), "The generalized hyperbolic skew Student's t-distribution," Jour- 
nal of Financial Econometrics, 4, 275-309. 

Aas, K., Haff, I. H., and Dimakos, X. K. (2006), "Risk estimation using the multivariate normal 
inverse Gaussian distribution," Journal of Risk, 8, 39-60. 

Aguilar, O. and West, M. (2000), "Bayesian dynamic factor models and portfolio allocation," 
Journal of Business and Economic Statistics, 18, 338-357. 

Arnold, B. C. and Beaver, R. J. (2000), "The skew-Cauchy distribution," Statistics & Probability 
Letters, 49, 285-290. 

Asai, M., McAleer, M., and Yu, J. (2006), "Multivariate stochastic volatility: A review," Econo- 
metric Reviews, 25, 145-175. 

Azzalini, A. (2005), "The skew-normal distribution and related multivariate families," Scandina- 
vian Journal of Statistics, 32, 159-188. 



20 



Azzalini, A. and Capitanio, A. (1999), "Statistical applications of the multivariate skew normal 
distribution," Journal of the Royal Statistical Society B, 61, 579-602. 

— (2003), "Distributions generated by pertubation of symmetry with emphasis on a multivariate 
skew t distribution," Journal of the Royal Statistical Society B, 65, 367-389. 

Azzalini, A. and Valle, A. D. (1996), "The multivariate skew-normal distribution," Biometrika, 
83, 715-726. 

Bauwens, L. and Laurent, S. (2005), "A new class of multivariate skew densities, with application 
to generalized autoregressive conditional heteroscedasticity models," Journal of Business & 
Economic Statistics, 23, 346-354. 

Bauwens, L., Laurent, S., and Rombouts, J. V. K. (2006), "Multivariate GARCH models: A sur- 
vey," JournaZ of Applied Econometrics, 21, 70-109. 

Branco, M. D. and Dey, D. K. (2001), "A general class of multivariate skew-elliptical distribu- 
tions," Journal of Multivariate Analysis, 79, 99-113. 

Cabral, C. R. B., Lachos, V. H., and Prates, M. O. (2012), "Multivariate mixture modeling using 
skew-normal independent distributions," Computational Statistics & Data Analysis, 56, 126- 
142. 

Carvalho, C. M., Johannes, M. S., Lopes, H. R, and Poison, N. G. (2010), "Particle learning and 
smoothing," Statistical Science, 25, 88-106. 

Chib, S., Nardari, F., and Shephard, N. (2006), "Analysis of high dimensional multivariate 
stochastic volatility models," JoumaZ of Econometrics, 134, 341-371. 

Clyde, M. and George, E. I. (2004), "Model uncertainty" Statistical Science, 19, 81-94. 

Cogley T. and Sargent, T. J. (2005), "Drifts and volatilities: Monetary policies and outcomes in 
the post WWII U.S." Review of Economic Dynamics, 8, 262-302. 

Conrad, C, Karanasos, M., and Zeng, N. (2011), "Multivariate fractionally integrated APARCH 
modeling of stock market volatility: A multi-country study," Journal of Empirical Finance, 18, 
147-159. 

Creal, D., Koopman, S. J., and Lucas, A. (2011), "A dynamic multivariate heavy-tailed model 
for time-varying volatilities and correlations," Journal of Business & Economic Statistics, 29, 
552-563. 



21 



de Jong, P. and Shephard, N. (1995), "The simulation smoother for time series models," 
Biometrika, 82, 339-350. 

Doornik, J. (2006), Ox: Object Oriented Matrix Programming, London: Timberlake Consultants 
Press. 

Durham, G. B. (2007), "SV mixture models with application to S&P 500 index returns," JoumaZ 
of Financial Economics, 85, 822-856. 

Eraker, B. (2004), "Do equity prices and volatility jump? Reconciling evidence from spot and 
option prices," Journal of Finance, 59, 1367-1403. 

Fernandez, C. and Steel, M. F. J. (1998), "On Bayesian modeling of fat tails and skewness," 
Journal of the American Statistical Association, 93, 359-371. 

George, E. I. and McCulloch, R. E. (1993), "Variable selection via Gibbs sampling," Journal of 
the American Statistical Association, 88, 881-889. 

— (1997), "Approaches for Bayesian variable selection," Statistica Sinica, 7, 339-373. 

George, E. I., Sun, D., and Ni, S. (2008), "Bayesian stochastic search for VAR model restrictions," 
Journal of Econometrics, 142, 553-580. 

Geweke, J. F. and Zhou, G. (1996), "Measuring the pricing error of the arbitrage pricing theory," 
Review of Financial Studies, 9, 557-587. 

Ghysels, E., Harvey, A. C, and Renault, E. (2002), "Stochastic volatility," in Statistical Methods 
in Finance, eds. Rao, C. R. and Maddala, G. S., Amsterdam: North-Holland, pp. 119-191. 

Gourieroux, C., Jasiak, J., and Sufana, R. (2009), "The Wishart autoregressive process of multi- 
variate stochastic volatiUty," Journal of Econometrics, 150, 167-181. 

Gupta, A. K., Gonzalez-Fan'as, G., and Dominguez-Molina, J. A. (2004), "A multivariate skew 
normal distribution," Journal of Multivariate Analysis, 89, 181-190. 

Hansen, B. E. (1994), "Autoregressive conditional density estimation," International Economic 
Review, 35, 705-730. 

Ishihara, T. and Omori, Y. (2012), "Efficient Bayesian estimation of a multivariate stochastic 
volatility model with cross leverage and heavy-tailed errors," Computational Statistics and 
Data Analysis, 56, 3674-3689. 



22 



Ishihara, T., Omori, Y., and Asai, M. (2011), "Matrix exponential stochastic volatility with cross 
leverage," Discussion paper series, CIRJE-F-812, Faculty of Economics, University of Tokyo. 

Jacquier, E., Poison, N., and Rossi, P. (2004), "Bayesian analysis of stochastic volatility with 
fat-tails and correlated errors," Journal of Econometrics, 122, 185-212. 

Jones, M. C. and Faddy, M. J. (2003), "A skew extension of the t-distribution, with application," 
Journal of Royal Statistical Society, Series B, 65, 159-174. 

Kim, S., Shephard, N., and Chib, S. (1998), "Stochastic volatility: Likelihood inference and 
comparison with ARCH models," Review of Economic Studies, 65, 361-393. 

Kupiec, P. (1995), "Techniques for verifying the accuracy of risk measurement models," Journal 
of Derivatives, 2, 173-184. 

Lopes, H. F., McCulloch, R. E., and Tsay, R. (2012), "Cholesky stochastic volatility models for 
high-dimensional time series," Tech. rep.. University of Chicago, Booth Business School. 

Markowitz, H. (1959), Portfolio Selection: Efficient Diversification of Investments, New York: John 
Wiley and Sons. 

Nakajima, J. (2012), "Bayesian Analysis of Latent Threshld Models," Ph.D. thesis, Duke Univer- 
sity, Durham, N.C. 

Nakajima, J. and Omori, Y. (2009), "Leverage, heavy-tails and correlated jumps in stochastic 
volatility models," Computational Statistics and Data Analysis, 53, 2535-2553. 

— (2012), "Stochastic volatility model with leverage and asymmetrically heavy-tailed error us- 
ing GH skew Student's i-distribution," Computational Statistics and Data Analysis, 56, 3690- 
3704. 

Nakajima, J. and Watanabe, T. (2011), "Bayesian analysis of time-varjdng parameter vector 
autoregressive model with the ordering of variables for the Japanese economy and monetary 
policy," Global COE Hi-Stat Discussion Paper Series 196, Hitotsubashi University. 

Nakajima, J. and West, M. (2012a), "Bayesian analysis of latent threshold dynamic models," 
Journal of Business and Economic Statistics, forthcoming. 

— (2012b), "D)mamic factor volatility modeling: A Bayesian latent threshold approach," Journal 
of Financial Econometrics, doi:10.1093/jjfinec/nbs013. 



23 



Omori, Y., Chib, S., Shephard, N., and Nakajima, J. (2007), "Stochastic volatility with leverage: 
fast likelihood inference," Journal of Econometrics, 140, 425-449. 

Omori, Y. and Watanabe, T. (2008), "Block sampler and posterior mode estimation for asymmet- 
ric stochastic volatility models," Computational Statistics and Data Analysis, 52, 2892-2910. 

Panagiotelis, A. and Smith, M. (2010), "Bayesian skew selection for multivariate models," Com- 
putational Statistics & Data Analysis, 54, 1824-1839. 

Pinheiro, J. C. and Bates, D. M. (1996), "Unconstrained parametrizations for variance- 
covariance matrices," Statistics and Computing, 6, 289-296. 

Pitt, M. and Shephard, N. (1999), "Time varying covariances: A factor stochastic volatility ap- 
proach (with discussion)," in Bayesian Statistics VI, eds. Bernardo, J. M., Berger, J. O., Dawid, 
A. P., and Smith, A. F. M., Oxford University Press, pp. 547-570. 

Pourahmadi, M. (1999), "Joint mean-covariance models with application to longitudinal data: 
Unconstrained parametrisation," Biometrika, 86, 677-690. 

Prado, R. and West, M. (2010), Time Series Modeling, Computation, and Inference, New York: 
Chapman & Hall/CRC. 

Prause, K. (1999), "The Generalized Hyperbolic models: Estimation, financial derivatives and 
risk measurement," PhD dissertation. University of Freiburg. 

Primiceri, G. E. (2005), "Time varying structural vector autoregressions and monetary policy," 
Review of Economic Studies, 72, 821-852. 

Sahu, S. K., Dey, D. K., and Branco, M. D. (2003), "A new class of multivariate skew distributions 
with applications to Bayesian regression models," Canadian Journal of Statistics, 31, 129-150. 

Shephard, N. (2005), Stochastic Volatility: Selected Readings, Oxford: Oxford University Press. 

Silva, R. S., Lopes, H. F., and Migon, H. S. (2006), "The extended generalized inverse Gaussian 
distribution for log-hnear and stochastic volatihty models," Brazilian Journal of Probability 
and Statistics, 20, 67-91. 

Smith, M. and Kohn, R. (2002), "Parsimonious covariance matrix estimation for longitudinal 
data," Journal of the American Statistical Association, 97, 1141-1153. 



24 



West, M. (2003), "Bayesian factor regression models in the "large p, small n" paradigm," in 
Bayesian Statistics 7, eds. Bernardo, J., Bayarri, M., Berger, J., David, A., Heckerman, D., 
Smith, A., and West, M., Oxford, pp. 723-732. 

Yu, J. (2005), "On leverage in a stochastic volatility model," Journal of Econometrics, 127, 165- 
178. 

Zhang, X., Creal, D., Koopman, S. J., and Lucas, A. (2011), "Modeling dynamic volatilities and 
correlations under skevmess and fat tails," Duisenberg School of Finance - Tinbergen Institute 
Discussion Paper, No.ll-078/2/DSF22. 



25 



